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ABSTRACT 

We predict the space density of molecular gas reservoirs in the Universe, and place a lower limit on the 
number counts of carbon monoxide (CO), hydrogen cyanide (HCN) molecular and [C II] atomic emission 
lines in blind redshift surveys in the submillimeter-centimeter spectral regime. Our model uses: (a) recently 
available HCN Spectral Line Energy Distributions (SLEDs) of local Luminous Infrared Galaxies (LIRGs, 
Lir > 10 11 L Q ), (b) a value for e i =SFR/Md cnS c(H2) provided by new developments in the study of star 
formation feedback on the interstellar medium and (c) a model for the evolution of the infrared luminosity den- 
sity. Minimal 'emergent' CO SLEDs from the dense gas reservoirs expected in all star-forming systems in the 
Universe are then computed from the HCN SLEDs since warm, HCN-bright gas will necessarily be CO-bright, 
with the dense star-forming gas phase setting an obvious minimum to the total molecular gas mass of any 
star-forming galaxy. We include [C II] as the most important of the far-infrared cooling lines. Optimal blind 
surveys with the Atacama Large Millimeter Array (ALMA) could potentially detect very distant (z ~ 10-12) 
[C II] emitters in the >ULIRG galaxy class at a rate of ~0. 1-1 per hour (although this prediction is strongly de- 
pendent on the star formation and enrichment history at this early epoch), whereas the (high-frequency) Square 
Kilometer Array (SKA) will be capable of blindly detecting z > 3 low-/ CO emitters at a rate of ^40-70 per 
hour. The [C II] line holds special promise for the detection of metal-poor systems with extensive reservoirs of 
CO-dark molecular gas where detection rates with ALMA can reach up to 2-7 per hour in Bands 4-6. 

Subject headings: galaxies: ISM — galaxies: starburst — galaxies: evolution — cosmology: observations — 
ISM: molecules: CO, HCN 



1. INTRODUCTION 

Since the first detections of the J=l— >0 rotational tran- 
sition of 12 CO and some of its isotopologues in Galactic 
molecular clouds ( 13 CO, C ls O) (Wilson et al. 1970; Pen- 
zias et al. 1971, 1972), and in galactic nuclei (Rickard et 
al. 1975) there have been many studies of CO line emission 
in galaxies using single dish radio telescopes and interfer- 
ometer arrays (for reviews see Young & Scoville 1991 and 
Solomon & Vanden Bout 2005). Multi-/ CO line ratio sur- 
veys are now routinely used to assess the state of the molec- 
ular gas in galaxies (e.g. Braine & Combes 1992; Aalto et 
al. 1995; Papadopoulos & Seaquist 1998; Mauersberger et 
al. 1999; Nieten et al. 1999; Yao et al. 2003; Mao et al. 
201 1) over the density regime where most of its mass resides 
(n ~ 10 2 ~ 3 cm -3 ). The fainter molecular line emission from 
heavy-rotor molecules such as HCN have also become a stan- 
dard tool for assesing the state and the mass of the denser 
(>10 4 cm -3 ) gas phase where stars actually form in Giant 
Molecular Clouds (GMCs, e.g. Nguyen-Q-Rieu et al. 1989; 
Solomon et al. 1992a; Paglione et al. 1995, 1997; Jackson et 
al. 1995). The role of the latter phase as the direct fuel of star 
formation in individual GMCs, quiescent disks and merger- 
driven spectacular starbursts in the local and distant Universe 
is now well established over an astounding 7-8 orders of mag- 
nitude (Gao & Solomon 2004; Wu et al. 2005; Juneau et al. 
2009; Wang etal.2011). 

In the past decade, numerous high-z detections have re- 
vealed the fundamental role of molecular lines in assessing 
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the state and mass of the molecular gas, and the dynamical 
mass of heavily dust-enshrouded galaxies in the early Uni- 
verse (Solomon & Vanden Bout 2005 and references therein), 
and in some cases have provided remarkable insights into 
the properties of the molecular interstellar medium (ISM) 
in early galaxies (see Danielson et al. 2010 for a recent ex- 
ample of a well sampled CO spectral line energy distribu- 
tion (SLED) in a z — 2.3 gravitationally lensed galaxy). 
This exploration began with the first detection of CO /(3— >2) 
line emission in the strongly-lensed distant dust-enshrouded 
galaxy IRAS 10214+4724 at z ~ 2.3 (Brown & Vanden 
Bout 1991; Solomon et al. 1992b). It continued with the 
detection of CO transitions in distant submm-bright galaxies 
(SMGs, L m > 10 12 L©) (Frayer et al. 1998, 1999; Greve et 
al. 2005) and is increasingly encompassing less extreme, but 
still massive systems such as Lyman Break galaxies (Baker et 
al. 2004), optical/near-infrared selected galaxies at z ~ 1.5 
(Dannerbauer et al. 2009; Daddi et al. 2010) and fortuitously 
lensed systems (Danielson et al. 201 1; Lupu et al. 201 1). Sev- 
eral spectacular CO line detections have also been obtained 
also for other high-redshift systems such as radio galaxies 
(e.g. De Breuck et al. 2005) and QSOs out to z ~ 6.4. This 
epoch is close to the era of reionization - the final frontier of 
galaxy evolution studies - revealing the gas-rich hosts to rapid 
galaxy growth at these early times (Walter et al. 2003, 2004; 
Weiss et al. 2007). 

These discoveries and advancements, made possible as 
sensitivities of millimeter/submillimeter interferometer arrays 
improved in the last decade, still yield only a glimpse of 
what will be a new era where molecular and atomic (e.g. 
[Cll] A158, [Cl]( 3 Pi^ 3 P ) ) ISM lines will replace nebular 
optical/near-infrared (OIR) lines as the main tool of choice 
for discerning galaxy formation and evolution across the full 
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span of cosmic time pertinent to galaxy growth, from the end 
of the reionisation epoch (z > 7) to the present (Walter & 
Carffli 2008). 

Direct 'blind' searches of gas-rich galaxies using submm- 
cm wave molecular and atomic lines are the only tool that 
can: (i) uniformly select galaxies according to their molecular 
gas content rather than their SFR and the star formation effi- 
ciency (a bias that has so far - necessarily - affected all high-z 
gas studies), (ii) immediately provides redshifts and eventu- 
ally dynamical mass information, (iii) holds the promise of 
discovering large outliers of the local £sfr-£h 2 (Schmidt- 
Kennicutt) relations (Kennicutt 1998), with large reservoirs of 
molecular gas but low levels of SFR (Papadopoulos & Pelu- 
pessy 2010), and (iv) can possibly determine the star forma- 
tion 'mode' (starburst/merger-driven versus quiescent/disk- 
like), in a uniform and extinction-free manner (Paper II, Pa- 
padopoulos & Geach 2012). 

Well-sampled CO spectral line energy distributions, and 
their robust normalization by some observable galaxy prop- 
erty are necessary for predicting the emergent CO line lumi- 
nosities in star-forming systems. The lack of these two key 
ingredients translates to major uncertainties for the source 
counts predicted for blank-field cm/mm/submm molecular 
line surveys (Combes et al. 1999; Blain et al. 2000; Carilli 
& Blain 2002), as well as the frequency and flux range where 
such surveys become optimal (Blain et al. 2000). The dense 
gas phase (n(H2)>10 cm" 3 ) and the linear relation of its 
mass to the SFR in individual GMCs (L m -10 4 - 5 L ), ul- 
traluminous infrared galaxies (ULIRGs, Lir^IO 12 Lq) and 
high-redshift extreme starbursts (HLIRGs, Lir^IO L q ) 
makes it an obvious benchmark for computing minimum 
emergent molecular line luminosities. Indeed only this gas 
phase yields physically meaningful estimates of the so-called 
star formation efficiency (and its equivalent interpretation 
in terms of gas consumption timescales) while the HCN- 
deduced (and thus well-excited CO SLEDs) contain minimal 
uncertainties up to high-/ rotational transitions of CO. The 
dense, HCN -bright, molecular gas phase in galaxies is thus an 
obvious ingredient of any theoretical models for blank-field 
cm/mm/submm molecular line surveys. 

2. OBJECTIVES OF THIS WORK 

In this work we compute the number counts of the 
star-forming molecular gas reservoirs in the Universe us- 
ing: (i) hydrogen cyanide (HCN) SLEDs of local lumi- 
nous infrared galaxies (LIRGs, Lir/^10 11 Lq), and (ii) an 
e^=SFR/A/d cnse (H2) value provided by recent studies of star 
formation feedback on the interstellar medium (ISM). The 
latter is crucial for relating the cosmic SFR(z) to the dense 
gas mass necessary to fuel it. Minimal emergent CO SLEDs 
up to CO/(10— >9), normalized by the galaxy SFRs, can 
then be computed even from partial low-/ HCN SLEDs 
(Jup < 4) since warm HCN-bright gas will also be CO- 
bright (n cri t(HCN)/n cri t(CO)>100), while the dense HCN- 
bright star-forming gas mass sets an obvious minimum to the 
total molecular gas mass of star-forming systems. 

These SFR-normalized minimal CO SLEDs can then be 
used as inputs to various galaxy-evolution models to yield 
minimum source counts in blank-field cm/mm/submm molec- 
ular line surveys of star-forming systems. We use an em- 
pirically based phenomenological model for the evolution of 
the bolometric (IR) luminosity function that accurately re- 
produces the observed number counts of galaxies in several 
infrared and sub-millimeter surveys (Spitzer, Herschel and 



Emergent 12 CO luminosity per M(H 2 ) d ense= 10 9 M 




J I I I I I I I I L 



1 23456789 10 

Figure 1. Emergent 12 CO spectral line energy distributions for a gas mass 
ofM dense = JO 9 M ,forourvirial(X co = 9M Q (Kkms" 1 pc 2 )" 1 ) 
and super- virial (Xco — 3 Mq (Kkms _1 pc 2 ) _1 ) models. The solid 
line shows just the emergent flux for the dense phase, and the dashed line 
shows the total emission when the quiescent (i.e. cold SLED, Xco = 
5 Mq (Kkms -1 pc 2 )" 1 ) phase is included (see §3.2). This is the most 
conservative estimate for the molecular line emission from a star-forming 
galaxy, with the range in CO luminosity given by the two models reflect- 
ing the possible range in emission that would arise from different dynamic 
configurations of the cold ISM in active galaxies (§3.1). 

SCUBA) to predict the number counts of molecular line- 
emitting galaxies seen by the Atacama Large Millimeter Ar- 
ray (ALMA), the Jansky Very Large Array (JVLA) and the 
Square Kilometer Array (SKA) and its pathfinders. Using the 
shape of the integral numbers counts as a guide, we suggest 
the strategy for an optimal blind redshift survey (e.g. Blain 
et al. 2000; Carilli & Blain 2002) that could detect gas-rich 
galaxies across the full history of galaxy evolution. Through- 
out we assume a ACDM cosmological model, with fi ro = 0.3, 
Q\ = 0.7 and H = TOkms" 1 Mpc" 1 . 

3. THE MOLECULAR ISM MODEL: A MINIMALIST APPROACH 

3.1. The dense, star-forming gas phase 

Past studies used rather poorly constrained models of the 
molecular phase ISM and its CO line excitation range to 
derive the emergent line luminosities (e.g. Combes et al. 
1999; Blain et al. 2000). Here we make use of avail- 
able HCN J up < 4 SLEDs of LIRGs (Papadopoulos et 
al. 2007; Krips et al. 2008; Juneau et al. 2009) to safely 
extrapolate the corresponding emergent CO SLEDs from 
CO/(l— >0)to CO/(10->9)for the dense star-forming gas in 
galaxies. This is possible since even a modestly excited global 
HCN/(4^3)line emission (e.g. HCN (4 -» 3)/(l -> 0) = 
0.30) implies gas densities n(Ha) ~ 3x 10 4 — 10 5 cm -3 (e.g. 
Papadopoulos et al. 2007), similar to the critical density of 
CO/(9->8)(?i cr =2.2 x 10 5 cm~ 3 ). Furthermore, unlike past 
studies, we use our Large Velocity Gradient (LVG) radiative 
transfer code to constrain the properties of the dense gas phase 
using observed global HCN line ratios in LIRGs and ivT v ii ~ 1 
where 



Molecular and atomic line surveys of galaxies I 



3 




10 20 50 100 200 400 800 

Frequency (GHz) 

Figure 2. Schematic view of the visibility of the J up < 10 transitions of CO 
and HCN to z < 10, and the [C I]( 3 Pi-s> 3 Po) and [C II] fine structure lines, 
indicating the band coverage offered by the SKA high-frequency component 
and its pathfinder MeerKAT, as well as the radio bands X, U, K, Ka, Q and 
ALMA Bands 3-10. SKA, MeerKAT and JVLA offers the ability to detect 
low-/ CO and HCN emitting galaxies out to very high redshifts, whereas 
ALMA covers a broader range of transitions up to high-/ over a large span 
of cosmic time, but can access low-/ lines only out to z < 3. The vertical 
bars show the bandpasses of ground-based instrumentation that could access 
the FIR lines in the sub-mm regime (e.g. the Redshift and Early Universe 
Spectrometer [ZEUS], see Stacey et al. 2007). 
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parametrizes the average dynamic state of the corresponding 
gas, with K V i T ~l corresponding to virial gas motions (typical 
for dense HCN-bright star-forming cores in the Galaxy), and 
i^vir^l corresponding to unbound gas (typically found for 
low-/ CO line emission in LIRGs). The parameter a ~ 0.55- 
2.4 depends on the average cloud density profile (see Bryant 
& Scoville 1996), here we choose a = 1.5. The quantity 
Arcn = [HCN/H 2 ]/(dV/dR) is one of the three parameters 
defining the grid of a typical LVG code (the other two being 
Tkin and n(H 2 )). Furthermore we adopt Tkm^Tdust, which 
is certainly a good approximation for the dense gas phase 
where strong gas-dust thermal coupling sets in. Abundances 
of [HCN/H 2 ]=2 x 1CT 8 and [CO/H 2 ]=l(r 4 are adopted for 
our estimates of K VK for HCN and CO LVG solutions. 

Multi-7 HCN line surveys of LIRGs find average 
r J+ u(HCN) = HCN (J + 1 -> J)/ {I -> 0) brightness 
temperature ratios of r 2 i(HCN) ~ 0.65 and r 32 (HCN) ~ 
0.45 (Krips et al. 2008; Juneau et al. 2009). These can 
be higher still (up to ~1) for extreme mergers whose ISM 
is dominated by very dense gas (Greve et al. 2009), but 
the aforementioned ratios are good representatives of the av- 
erage values observed in LIRGs and we use them as con- 
straints on the properties of the HCN-bright star-forming 
dense gas phase. The LVG grid is run for temperatures of 
Tkin = (30 — 80) K fully encompassing the typical range ex- 



pected for star-forming gas and its concomitant dust. We find 
n(H 2 ) = 3 x 10 4 cm~ 3 , T kiu = 40 K, and K vil - 1 as well as 
n(H 2 ) = 10 5 cm" 3 , T kin = (30-35) K and 7T vir - 8 as typ- 
ical solutions. The virial solution is the most likely one for a 
dense star-forming gas phase in LIRGs (in current turbulence- 
regulated star-forming models of galaxies self-gravity over- 
comes turbulent gas motions at such high densities, Krumholz 
& McKee 2005). We nevertheless consider also the solution 
with A"vi r ~8, corresponding to gravitationally unbound gas 
motions; this 'super-virial' solution is possibly pertinent to 
the high density gas found in some extreme starbursts at high 
redshifts (Swinbank et al. 2011). Any additional excitation 
mechanisms such as hard X-rays from an active galactic nu- 
clei (AGN; Meijerink & Spaans 2005), and/or high cosmic 
ray energy densities due to the high supernovae rate densities 
expected in compact ULIRGs (Papadopoulos 2010) can only 
raise the dense gas temperatures leading to even more excited 
HCN and CO SLEDs. 

Finally we note that for simplicity we omit the enhanced 
cosmic microwave background (CMB) as it will have negli- 
gible effects on the dense and warm star-forming gas phase 
in galaxies, even at high redshifts. For a detailed discus- 
sion on this see Papadopoulos et al. 2000 (section 4.2), but 
this can be briefly demonstrated by assuming Tki n = 40 K 
and Tki rl ~ Td UB t, typical for the dense, HCN-bright, star- 
forming gas in LIRGs. The thermodynamically equivalent 
T(z) = Tdust^) ~ Tkin (z) of that phase at z = 5wouldthen 

be: T(z) = (T(0f -T CMB (0f +T C mb(z) 6 ) 1/6 « 40 K, 
i.e. it remains essentially identical (a dust emissivity law of 
a = 2 is assumed). The impact can be more significant 
for CO line ratios of a cold and quiescent gas phase (for a 
Tkin(0) = 15 K, T kin (z = 5) = 17.56K) where these ratios 
can be boosted somewhat by the enhanced CMB (see also Ta- 
ble 2 in Papadopoulos et al. 2000), while the line/CMB con- 
trast for the cold, quiescent H 2 phase diminishes. In that re- 
gard our assumed local CO line ratios for that phase (see fol- 
lowing section) remain conservative and in the spirit of our 
minimalist approach. 

Here we must note that any such global LVG solutions re- 
flect, at best, a rough average of the gas conditions responsible 
for the observed global line ratios considered. Indeed, as is the 
case for the so-called Xqo, AThcn, etc. factors that convert 
the CO and HCN J = 1 — > line luminosities to mass for 
the total and the dense molecular gas reservoirs respectively, 
these LVG solutions represent effective ensemble averages for 
large collections of molecular clouds, and thus appropriate for 
unresolved studies of high-z galaxies. In Table 1 we list the 
LVG-derived line ratios that correspond to the two types of 
LVG solutions found for the dense gas. These have very sim- 
ilar HCN line ratios up to J up = 5 (we do not extrapolate 
above this transition as data for only up to HCN 7(4— >-3) exist 
in the local Universe), and CO up to J up = 4. Above 
J up = 4 we give the range of acceptable values yielded by 
the LVG model. The corresponding -Xco.dense factors are: 
~9 Mq (Kkms -1 pc 2 ) -1 and -3 M Q (Kkms" 1 pc 2 )" 1 for 
the virial and the super-virial LVG solution respectively, and 
along with the computed line ratios (Table 1) are used to ob- 
tain the range of CO SLEDs for the dense gas shown in Fig- 
ure 1 . 

The computed CO SLEDs are minimal in two ways, 
namely: 

1 . Only a fraction of the total molecular gas mass in galax- 
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Figure 3. Predicted CO line fluxes for luminous infrared galaxies as a function of redshift (shown as contours of constant line flux with units of log(_F/W m~ 2 ) 
labelled) from our emergent model. Bold lines show the 'viriaT model prediction, and grey lines show the equivalent 'super-virial' model. Given the sensitivities 
of submm/mm/cm instrumentation, this model can be used as a conservative estimate of the time required to detect a CO emission line from star-forming 
(infrared-bright) galaxies across cosmic time. This will be valuable in follow-up spectroscopy of submm continuum surveys to be conducted with, e.g. JCMT, 
LMT & CCAT that will track LIRG-ULIRG level activity out to z ~ 5. 



ies belongs to this dense star-forming gas phase con- 
tributing to such SLEDs at any given moment of their 
evolution. 

2. The observed global HCN line ratios used for 
their derivation inadvertently include some non-star- 
forming, less dense gas diluting the HCN ratios from 
those typical for the star-forming phase alone. 

The HCN SLEDs are considered only up to J up = 5 since: 

(a) available HCN line data for LIRGs in the local Universe 
exist only up to J up = 4 (e.g. Papadopoulos 2007), and 

(b) only HCN lines up to J up — 3 were used in our LVG 
model (thus interpolation beyond J up = 5 is not safe). We 
also note that the normalization of the HCN SLED, namely 
the X H cN=M denS e(H 2 )/LHCN( 1 °) factor is not iden- 
tical to that of CO SLEDs since the HCN /(l-»0) is not 
fully thermalized (unlike CO 7(1— >0)) even at the high den- 
sities of our LVG solutions. The latter yield Xhcn={9, 
27} M Q (Kkms -1 pc 2 ) -1 for the unbound (i.e. super-virial) 
and the virial solution respectively. 

The total CO SLED of a star-forming galaxy will also in- 
clude non-star-forming gas in a much more tenuous phase. 
This typically contains the bulk of the molecular gas in in- 
dividual GMCs, and the molecular gas reservoirs in local 
LIRGs (except perhaps in ULIRGs). Such quiescent, cool 
gas at lower densities can thus contribute significantly to the 
velocity-integrated flux densities of the CO J up < 3 lines (as 
well as to the [C l]( 3 Fi— > 3 Po) li ne of atomic carbon) increas- 
ing the source counts in line surveys that would include those 
transitions. Therefore, in addition to the emergent CO line 
flux computed for the HCN-bright dense star-forming phase 
we also include an estimate for the low-/ CO emission from 
this cold, quiescent phase. 

3.2. The cold, quiescent gas phase 

In the local Universe the star formation 'mode' seems to 
be very well (i.e. uniquely) indicated by the fraction of 
the total gas reservoir residing in the dense phase: £sf = 
-Mdense(H2)/M to t(H2). Local studies suggest typical values 



of £sf ~ 0.25 (and even >0.50) for merger-driven starburst 
systems (Solomon et al. 1992a; Gao & Solomon 2004; Pa- 
padopoulos et al. 2012) and £sf ~ 0.025 for secular activity 
in 'quiescent' systems, with the complete star-forming popu- 
lation potentially forming a continuum of £sf (cf. Daddi et 
al. 2010). Therefore, an estimate of the mass of the quiescent, 
non-star-forming gas phase can be calculated by: 

A/quiescent (H 2 ) = Md C nso(H2) ^ — - ^ SF ^ (2) 

In the spirit of our 'minimal' approach, the most conser- 
vative estimate of M qu i escent (H2) is given by assuming all 
galaxies are in the burst mode, with £sf = 0.25, such that 
Acescent (H 2 ) = 3 x M dense (H 2 ). Armed with this mass 
estimate, we can assign CO luminosities by assuming a fixed, 
representative SLED for this phase. 

We assume the 'coldest' known CO SLED in the local Uni- 
verse, which is found in cold quiescent clouds in M 3 1 with 
CO line ratios: r 2l = 0.42 ± 0.10, r 32 = 0.14 ± 0.04, 
#12/13 JO -+ °) = 12 CO/ 13 CO = 9 ± 2 (Allen et al. 1995; 
Loinard et al. 1995, 1996; Loinard & Allen 1998; Israel et 
al. 1998 and references therein). For the [C l]( 3 Pi^ 3 P ) line 
emission in the same environments nci]/co(l — 0) = 0.1 ± 
0.02 (Wilson 1997; Israel et al. 1998), while COBE deter- 
mined the [CI] (2 ->■ 1)/(1 ->■ 0) line ratio (difficult to ob- 
tain from the ground) in the similar environment of the outer 
Galaxy disk to be r [CI] (2 ->• 1/1 ->• 0) = 0.22 ± 0.09 
(Fixsen et al. 1999). We then use our LVG radiative trans- 
fer model, constrained by the available CO, 13 CO line ratios 
and i\~ V j r ^l to find the corresponding average ISM states, 
higher-/ CO line luminosities, and compute the Xco fac- 
tor normalizing the cold CO SLED in terms of H 2 molec- 
ular gas mass. We find typical virial LVG solutions with 
Tkin = (10 - 15) K, and X C o~5 A^Kkms- 1 pc 2 )" 1 (i.e. 
typical of cold GMCs in the Galaxy) and CO line ratios of 
r 2 i = 0.45-0.55, r 32 = 0.12 - 0.13, r 43 = 0.09 - 0.011 and 
^(.j+i)-j^;10 _4 for (J + 1)>4. We plot the additional cold 
component of the quiescent gas phase in Figure 1 . In Figure 2 
we present a schematic plot indicating the observed frequency 
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and atmospheric windows of the principle emission lines we 
consider in this work. 

3.3. Far-infrared cooling lines 

The dominant coolant for the warm (T < 1000 K) neutral 
ISM is the far-infrared (FIR) fine structure line [Cll] 158/im, 
carrying up to 1 % of the total FIR output. When redshifted 
beyond z > 1 this line will be an important contributor to 
detections in the submillimeter bands, and therefore important 
to consider in our model. 

We adopt an semi-empirical prescription for estimating 
the luminosity of [C II] based on the compendium of In- 
frared Space Observatory (ISO) Long Wavelength Spectrom- 
eter (LWS) observations of 227 local (cz < lO^ms" 1 ) 
galaxies by Brauher, Dale & Helou (2008). These authors 
note the scalings between the principle FIR fine structure 
line luminosities and FIR luminosity (here we consider the 
IRAS FIR luminosity to be equivalent to our assumed total in- 
frared luminosity) of galaxies, as well as the far-infrared color 
/so/Zioo (sensitive to the intensity of the underlying radiation 
field heating the dust; Stacey et al. 2010). For example, the ra- 
tio Liqu]/ Lpui drops from 1% to 0.1% with increasing (hot- 
ter) feo I /ioo • This could reflect a harder and more intense 
far-UV field that results in conditions less efficient at produc- 
ing C + (Malhotra et al. 2001), or an extra contribution to the 
FIR continuum from regions not associated with the PDRs 
(Luhman et al. 2003; Stacey et al. 2010). This phenomenon 
has become known as the [Cll] deficit, although it is not clear 
whether it is a true deficit. 

To evaluate the relevant scalings between line luminosities 
and IR luminosity, we only consider galaxies in the Brauher, 
Dale & Helou compendium with nuclear regions classed as 
'star-forming' that were unresolved by the LWS beam (D. 
Dale, private communication). Fitting a linear trend to the 
data, we find the scaling: 

log (L [cu] /L Fm ) = (-0.74±0.39) (/ 60 //ioo)-(2.08±0.25). 

(3) 

The uncertainties are the 1 a errors in the formal fit and re- 
flect the large scatter in the data. Although the strength of 
the [C II] line varies by a factor ^10, and can contribute up 
to <~1% of the FIR luminosity, in the spirit of our minimal- 
ist approach, we adopt a conservative scaling that predicts a 
modest emergent £[cii] for a galaxy with Lir. We therefore 

adopt /60//100 = 1, which yields log (£[cii]/£fir) = -2.8. 
While this value is a conservative estimate for local galax- 
ies, is it appropriate for (on average higher luminosity) high- 
redshift star-forming populations? Given the high luminosity 
of the line, there is a growing sample of [C II] detections in 
ULIRG-class systems. For example, Stacey et al. (2010) find 
a an average ratio of log (£[ CII ]/Xfir) = —2.5 for (star for- 
mation dominated) ULIRGs at 1 < z < 2; we therefore con- 
sider our canonical value based on the extreme tail of local 
star-forming galaxies suitable for application in our model. 

3.3.1. The [C II] line as tracer ofCO-dark H2 at high redshifts 

The presence of a CO-deficient and even CO-dark molec- 
ular gas reservoir is expected in globally metal-poor sys- 
tems such as local dwarf galaxies as well as Lyman-break 
galaxies (LBGs) at high redshifts, the result of efficient CO 
dissociation and a strongly self-shielding H 2 (e.g. Pak et 
al. 1998). Such a phase is even expected in typical local 



spiral disks at large galactocentric distances as a result of 
metallicity gradients (Papadopoulos et al. 2002). This very 
much inhibits the detection of such H2 gas reservoirs via the 
workhorse CO lines but at the same time anticipates very 
bright [C II] emission. In local dwarf galaxies the latter re- 
vealed ^10-100 times larger molecular gas mass than that 
inferred by CO (Madden et al. 1997). 

Similarly strong [C II] emission is expected for high-z sys- 
tems like LBGs which are very difficult to detect in CO lines 
(see Baker et al. 2004, Coppin et al. 2007 for the only such 
examples), while early evidence seems to corroborate this for 
other types of high-z systems (Maiolino et al. 2009). A gen- 
eral [C II] emission enhancement at high redshifts with respect 
to that expected from local template systems with similar in- 
frared luminosities can be a result of a general evolution- 
ary trend towards more metal-poor gas reservoirs at earlier 
epochs, and could much enhance the potential of the [C II] as 
a galaxy redshift survey tool. We caution though that its emis- 
sion, once detected, will be much harder than, for example, 
[C l]( 3 Pi— > 3 Po) to interpret solely in terms of total molecu- 
lar gas mass as the [C II] line will contain significant contri- 
butions from ionized and neutral hydrogen (e.g. Madden et 
al. 1997). These non-H 2 contributions to [Cll]line emission 
will be hard to correct for at high redshifts, where they may 
actually be boosted in metal-poor environments and/or strong 
average far-UV radiation fields of nascent starbursts. 

To explore these possibilities we assume a population of 
objects with L[ CII ]/Ltr = 10~ 2 , a value typical for the LMC 
and IC 10 (Madden et al. 1997). At such levels the [C II] line 
of LIRG-class systems can be detected right out to the epoch 
of re-ionisation (z <~ 10) with ALMA in bands 4-9, within 
8 hrs of full aperture synthesis observations (assuming one bin 
channel of 300 kms -1 , see §5.1). The evolutionary track of 
the SFR of such a putative galaxy population with a nearly 
CO-dark ISM is unknown, leaving the line number counts un- 
certain, however we discuss the potential for blind discovery 
of such systems assuming their abundance and evolution is 
identical to that of LBGs in §5. 1 . 1 . 

3.4. Calculating line luminosities 

For the various computations and transformations between 
line luminosity types, and for linking the latter to observed 
velocity-integrated line fluxes we use standard relations, 



L' x = [ [ T b , x dadV 

JAV J A, 



I AV J A 3 

r 2 



2k BK.rcst V 1 + z 



S„dV, 



(4) 



Ay 



where Tb, x is the rest-frame brightness temperature of the 
line, AV, A s are the line FWZI and source area respectively. 
After substituting astrophysical units this yields, 



L' = 3.25 x 10 7 



£>|(Mpc) 



1 + z 
Iav s * dv 



/ ^x,rest \ 

V GHz ) 



Jykms 



Kkms 



"V 2 , 



(5) 



where Dl is the luminosity distance, and ^ XirC st is the rest 
frame line frequency. The conversion to traditional luminosity 
units (Lq), used for the total line luminosities (L x = J L v dv) 
in CO SLEDs, can be made using 
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Table 1 

Normalized emergent CO and HCN SLEDs 



£SF,, 



Transition 


CO (dense) 


rj+l,.J a 

CO (quiescent) 


HCN 


l-s>0 


1.00 


1.00 


1.00 


2->l 


0.95 


0.50 


0.75 


3^2 


0.88 


0.13 


0.40 


4^3 


0.82 


0.09 


0.15 


5^4 


0.70-0.75 


10- 4 


0.05 


6^5 


0.64-0.70 






7^6 


0.53-0.60 






8^7 


0.31-0.42 






9^8 


0.10-0.20 






10^9 


0.01-0.04 








L j+i=j/ L 







_ 87rfc B ^ rcst 

J^ r. — o L„, 



= 3.18 x 10' 



V100 



1.00 GHz) 



L' 



lQSKkms-ipc 



Lr,. 



(6) 



Note that integrated line fluxes are related to the commonly 
used SAV integrated flux density units by: 



3 x 10- 4 \ 

x 

^obs GHz / 



10-26 Wm- 2 



SAV 
Jy km s~ 



- • (7) 



3.5. The normalization of dense molecular gas SLEDs 

The final remaining step for calculating the emergent 
molecular line emission from a star-forming galaxy is to nor- 
malize the SLEDs according to the infrared luminosity (i.e. 
SFR) of the system. Recent studies of star formation feedback 
suggest a maximum = Li R /M dense (H 2 ) ~ 5OO(L /M ) 
for the dense and warm gas Md en se(H 2 ) near star forming 
sites in galaxies as a result of strong radiation pressure from 
the nascent O, B star clusters onto the concomitant dust of the 
accreted gas fuelling these sites (Scoville 2004; Thompson 
et al. 2005; Thompson 2009). Thus, provided that average 
dust properties (e.g. its effective radiative absorption coeffi- 
cient per unit mass) remain similar in metal-rich star-forming 
systems such as LIRGs, a near-constant e* is expected for the 
dense star-forming gas. A value of ~ 500 (L /M ) is ac- 
tually measured in individual star-forming sites of spiral disks 
such as M 5 1 and entire starbursts such as Arp 220 (Scoville 
2004), while -(440 ± 100) (L /M ) is obtained for CS- 
bright star-forming cores in the Galaxy (Shirley et al. 2003). 
It must be noted however that the intermittency expected for 
galaxy-sized molecular gas reservoirs (i.e. at any given epoch 
of a galaxy's evolution some dense gas regions will be form- 
ing stars while others will not) can lower the global to — V3- 
V2 of the Eddington value (Andrews & Thompson 201 1). 

Similar e* values can be obtained without explicit use of 
the Eddington limit (and the detailed dust properties it en- 
tails), but from the typical L*/M neWj * in young starbursts 
where M ncw * is the mass of the new stars and L* their bolo- 
metric luminosity (<~iiR for the deeply dust-enshrouded star- 
forming sites). For e S F,c=-^new,*/Wnew,* + M*(H 2 )] as the 
star formation efficiency (SFE) of the dense gas regions where 
the new stars form is: 



1 - esF,, 



-Mn fiw . 



(8) 



For e SF c — 0.3-0.5 typical for dense star-forming regions, 
and Lm'/Mnew,* =300-400 (M /L ) (Downes & Solomon 
1998 and references therein), equation 8 yields e*~130- 
400 (M /L ). Here we choose e* =250 (L Q /M Q ), close 
to the average values yielded by equation 8, and the black 
body limit deduced for the compact CO line emission con- 
comitant with an optically thick (rioo^m>l) dust emission 
seen in ULIRGs (Solomon et al. 1997). 

Thus Eddington-limited (i.e. radiation pressure limited) star 
formation in LIRGs sets a near-constant mass normalization 
of the dense star-forming gas phase using the star formation 
driven infrared luminosity and thus allows a Mdcnsc(H2)/SFR 
value to be deduced. This can then be used to set the abso- 
lute scale of the emergent minimal CO SLEDs of the star- 
forming gas phase using the SFR(z) of an input galaxy evolu- 
tion model, since it is 



ico(l-O) 



M d (H 2 )/Xco 
iiR./e*^co 
fc s f r SFR/ e*Xco 



(9) 



which can be converted to solar units using equation 6, and 
with an identical relation applying to the HCN 7(1— >0) line 
luminosity but utilizing the corresponding Xhcn values. 

In Figure 3 we present the observed CO line fluxes pre- 
dicted for both the virial and super-virial models for a 
range of Ltr and redshift; this can be used as a 'ready 
reckoner' to estimate the CO line flux for a given galaxy 
where some estimate of the infrared luminosity is known. 
As noted in §3.1, the LVG solutions for the dense gas 
phase are X co ={3, 9} M (Kkm s _1 pc 2 )~\ and X HCN ={9, 
27} M (Kkms _1 pc 2 ) -1 for the super-virial and virial cases. 
Given the well-excited CO SLEDs expected for the dense star- 
forming gas phase (Figure 1), their normalization to the dense 
gas mass, namely the Xqo factor, determines to a great degree 
the 'visibility' of these SLEDs in the distant Universe. For a 
given amount of dense gas lower Xqo, -^hcn values than 
the adopted ones correspond to brighter CO and HCN SLEDs 
though we consider them unlikely for the self-gravitating or 
only modestly unbound dense star-forming gas in LIRGs. 

The LVG-derived CO line ratios (Table 1) can then be 
used to derive the expected fluxes for the other CO tran- 
sitions from the SFR-normalized ^cofi-ov F° r systems 
with lower metallicities we assume proportionally less dust 
per molecular gas, and thus the Eddington limit becomes 
e* = 250(Z/Z Q )- 1 (Lq/M q ), where Z is the metallicity. 
Finally we note that while the dense HCN-bright gas fuelling 
star formation will be often a small fraction of the total molec- 
ular gas mass in galaxies it is nevertheless the only phase 
for which the frequently-used gas consumption timescale 
7"cons=A/dcnse(H2)/SFR may have its intended physical mean- 
ing as the duration of an observed star formation event. Strong 
star-formation and/or AGN feedback will almost certainly 
modify the 'consumption' timescale especially for the ex- 
tended, less dense, quiescent molecular gas by inducing pow- 
erful outflows (Sakamoto et al. 2009; Feruglio et al. 2010; 
Chung et al. 2011), or by fully dissociating large fractions 
of H 2 gas mass towards warmer and diffuse phases unsuit- 
able for star formation such as Cold Neutral Medium (CNM) 
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• Rodighiero et al. (2010) 




12 3 

Redshift 

Figure 4. Model of the evolution of the infrared luminosity density, fig- 
ure adapted from Figure 12 of Bethermin et al. (2011). Breaking down 
the total contribution into sub-LIRG (L IR < 10 11 L Q ), LIRG (L IR = 
1O 11_12 L0), ULIRG (Lir = 10 12 - 13 L Q ) and HLIRG (L m > 
1O 13 L0) clearly shows how the dominance changes from low-luminosity 
spiral galaxies in the local Universe, through LIRGs at the peak of the lumi- 
nosity density and to ULIRGs at z > 2. We use the model of Bethermin 
et al. as a basis for predicting the number density of star-forming galaxies 
across cosmic time. The points show the measurements of Rodighiero et al. 
(2010) which are based on Herschel PACS observations of the GOODS-N 
field. This figure also illustrates the basic form of the model: rapid evolution 
in both space density and characteristic luminosity to z ~ 0.9, followed by 
two phases of negative evolution to high-z. 

and Warm Neutral Medium (WNM) Hi gas (Pelupessy et al. 
2006). 

Lower metallicities will not have any effect on the emergent 
CO SLED shapes of the dense star forming gas phase in galax- 
ies as long as the optical depths of the CO lines remain signifi- 
cant. Nevertheless the CO-bright part of individual H2 clouds 
will shrink, leaving behind [Cll] and [C r]( 3 Pi-» 3 P ) bright 
H 2 gas (Bolatto et al. 1999 [Figure 1]; Pak et al. 1998). Thus 
the expected first order effect of lower metallicities would be 
to lower all CO line luminosities per H2 mass, while boosting 
up the [C II] and [C l]( 3 Pi— S> 3 P ) line luminosities. The latter 
may be crucial in the detection of metal-poor galaxies at high 
redshifts (see section 3.3.1). 

3.6. [C l]( 3 Pi — > 3 Pq) and HCN lines: a new promising 
avenue towards total gas mass and the star formation 
mode 

The submillimeter lines of atomic carbon, and especially 
[Cr]( 3 Pi— >- 3 P )at 492 GHz have been proposed as power- 
ful alternatives to the CO7(1^0)or CO 7(2^1) lines as to- 
tal molecular gas mass tracers (Papadopoulos et al. 2004), 
and have been shown to work to that effect in local LIRGs 
(Papadopoulos & Greve 2004). The [C l]( 3 Pi-> 3 P ) line 
may actually be a better tracer of total molecular gas 
mass than CO /(l-s-0) (Papadopoulos et al. 2004), taking 
advantage of the positive ^-correction with respect to the 
CO 7(1^0) line (while the CO 7(4— »3) line at similar fre- 
quency to [C l]( 3 Pi— > 3 Po) is a poor total molecular gas mass 



tracer as it is tied to the star-forming gas phase). More- 
over [C l]( 3 Pi— > 3 Pq) is optically thin (thus it does not need 
'X' factors to trace mass), has a simple partition func- 
tion, is easily excitable for the bulk of the H2 gas mass 
in galaxies, and of course remains accessible to ALMA 
for a much wider range of redshifts than CO 7(1— >0) or 
C07(2— >1) (Figure 2). When combined with a tracer of 
the dense, star-forming gas phase, combinations of, e.g. 
HCN 7(1^0) and [C l]( 3 Pi-> 3 P ) could be a powerful tracer 
of the star formation 'mode' of a galaxy, as traced by 
A7densc(H2)/Aitotai(H2). We discuss the prospect of prac- 
tical surveys of the star formation mode in an accompanying 
Paper II (Papadopoulos & Geach 2012). 

4. GALAXY NUMBER COUNTS MODEL 

In order to use the line emission model to predict the num- 
ber counts in a blind molecular line survey, we require a 
framework for the evolution of the abundance of galaxies as a 
function of infrared luminosities (i.e. the SFR history traced 
by the evolution of the infrared luminosity density). This 
can then be used to derive the corresponding evolution of 
A7densc(H2), and thus the emergent molecular line emission. 

We use the 'backwards evolution' parametric model of 
Bethermin et al. (201 1), which is a phenomenological model 
of the evolution of the bolometric luminosity function, based 
on the philosophy of Lagache et al. (2003), but updated 
to fit the latest empirical results from large area infrared 
and sub-mm surveys, including recent results from Herschel. 
Bethermin et al. (2011) employ a three-step evolution of the 
IR luminosity function, which evolves as (1 + zp in 0* and 
Z/*, with two redshift breaks (at z = 0.89 and z = 2) where 
the exponents of the evolution coefficients change. In sum- 
mary, the model can be described as a steep rise in the IR 
luminosity density out to the first break, followed by a rapid 
negative evolution to z — 2 (i.e. there is a 'burst' of activity 
at z ~ 1-2), followed by weak negative evolution to high-z. 
The model is only empirically constrained to z ~ 3, and so 
we caution the reader that extrapolations to higher redshifts, 
as we present here, are inherently uncertain. We examine the 
impact of an order-of-magnitude change in the infrared lu- 
minosity density evolution at z > 3 on our predicted counts 
in §5.1.1, and we acknowledge that any predictions we make 
for the abundance of gas reservoirs at very high redshifts are 
rather speculative; this was, in part, our motivation for a 'min- 
imal' ISM model for the number counts. 

The infrared luminosity density evolution model is suc- 
cessful at re-producing the differential counts across the full 
range of typical bandpasses, 24-850^m, as well as the ob- 
served evolution of the infrared luminosity function. The 
evolution of the infrared luminosity density broken into con- 
tributions from sub-LIRG (L m < 1O U L ), LIRG (L m = 
10 n - 12 L Q ), ULIRG (L m = 1O 12 - 13 £ ) and HLIRG 
(Lm > lO 13 ^©) classes is shown in Figure 4, compared to 
the latest measurements of the total infrared luminosity den- 
sity from Herschel (Rodighiero et al. 2010). Beyond z ~ 1 
the model slightly under-predicts the luminosity density (al- 
though the observational constraints become more uncertain 
at this epoch), indicating that the model could be considered 
a conservative estimate of the galaxy counts at high-z. 

5. RESULTS 

5.1. Minimal integral number counts in a line flux limited 

survey 
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Figure 5. Counts of galaxies with CO J up < 10 (solid lines, blue/thick to red/thin for increasing J), HCN J up < 5 (dashed lines blue/thick to red/thin for 
increasing J), [C II] (green line) and i*i; ne >10 — 24 -10 — 21 Wm — 2 as a function of observing frequency for the virial (top row) and super-virial SLED models 
(bottom row). We calculated the integrated counts for each line in a sliding 8 GHz wide window, and therefore quote the counts as the average per unit bandpass. 
The sharp cut-offs at the highest frequencies of each line correspond to z = 0, and the predictions terminate at frequencies corresponding to z = 10 in this 
figure. The distinctive shape of the counts when presented in this way is simply a reflection of the form of the evolution of the IR luminosity function, which has 
a peak of activity at z ~ 1-2, followed by weak negative evolution to high-z (see §4). 



The model of infrared number counts provides the frame- 
work on which to predict the number counts of molecu- 
lar lines emitted by star-forming galaxies, since Lir oc 
Afdcnso(H2). We apply the minimal emergent model of CO 
emission presented in §3 to predict Lir for a given line of 
integrated flux Fu ne seen at observed frequency v^. The 
L\sXz) model can then be integrated to predict the lower limit 
to the number counts of objects detected in a line flux limited 
survey across the full mm-cm regime. 

In Figure 5 we plot the minimal number counts across 
20 < ^ obs < 1200 GHz for [Cll] , HCN and CO J up < 10 
line emitting galaxies with F\[ nc > 10~ 24 -10~ 21 Wm~ 2 , av- 
eraged over an 8 GHz sliding window. Care will have to be 
taken to correctly identify single lines; the most robust strat- 
egy would be to detect two or more molecular lines for an 
accurate redshift identification. The shape of the counts of the 
individual lines reflects the evolution of the underlying lumi- 
nosity density described in §4. 

This burst of activity at z ~ 1 results in a corresponding 
peak in the line counts, which is probably artificially sharp in 
the present model; in reality we might expect a smoother turn- 
over (this can be mimicked by averaging the counts over a 
wider bandwidth). Nevertheless, this demonstrates the power 
a molecular line survey could have as a probe of the evolu- 
tion of the galaxy population, as the yield of line emitters 
in a spectral survey over sufficiently wide frequency range 
(> 100 GHz) will be a clean and sensitive tracer of the history 
of galaxy evolution that is complementary to previous studies 
that have focused on the evolution of the SFR and stellar mass 
functions. 

It will not be possible to fully sample the frequency range 
shown in Figure 2. Line searches from the ground are strictly 
restricted to the frequency windows dictated by the atmo- 



spheric transmission (the most significant for evolutionary 
surveys being the telluric feature at ^60 GHz), and the avail- 
ability of receivers capable of detecting the radiation transmit- 
ted through such windows. In the following sections we ex- 
amine the number counts expected in practical blind surveys 
conducted by the latest arrays with the sensitivity capable of 
performing them: ALMA, SKA and its pathfinder MeerKAT. 
JVLA will not be powerful enough to perform a blind survey, 
although we examine the number counts in the standard radio 
bands. We consider the detection-rate of molecular emission 
lines in two types of observing campaigns: 

1. Individual pointings of one hour integrations search- 
ing for > 5<7 detections of lines in 300 kms -1 (Av = 
1//10 3 GHz) bins (i.e. an integrated line detection where 
information about the line profile is discarded) across 
a set bandwidth. This strategy could be used for a 
controlled (i.e. flux limited) survey for a particular line 
species. 

2. Individual pointings that reach a line flux limit corre- 
sponding to the the knee of the integrated counts where 
A^(> F) oc F~ 2 , thus optimizing the detection rate 
of sources for a fixed observing time, again for detec- 
tions in 300 km s -1 channels (see Blain et al. 2000 and 
Carilli & Blain 2002). This strategy could be used for 
a simple 'redshift search', where one aims to simply 
identify large numbers of galaxies at high-z, or identify 
unique classes of object that are unlikely to be found by 
any other means. 

We note that that may be other routes to blind redshift sur- 
veys; for example, sensitive Fourier Transform Spectrometers 
(FTSs) that take advantage of the large fields of view of future 
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Table 2 

Predicted number counts of molecular emission lines 



Band 


u obs FoV a rms c 


log Fl hour 


logF op t N(>F lhoul ) 


N(>F opt ) 


lhr rate 


Optimal rate 


Serendipitous 




GHz arcmin 2 fi]y ^/hr 


Wm" 2 


Wm -2 deg - 2 


deg -2 


hour -1 


hour - 1 


10hr c 



Virial SLED model 



MeerKAT b 


to 


55.44 


38 


-22.7 


-23.4 


<1 


92 


0.011 


0.046 


0.39 


SKA K b 


18 


11.16 


1 


-23.8 


-22.8 


2215 


122 


6.867 


35.282 


14.93 


ALMA Band 3 


103 


0.83 


77 


-21.4 


-21.5 


174 


231 


0.040 


0.041 


0.28 


ALMA Band 4 


147 


0.40 


77 


-21.2 


-20.0 


629 


33 


0.070 


1.099 


0.24 


ALMA Band 5 


163 


0.24 


86 


-21.2 


-20.0 


761 


69 


0.050 


0.881 


0.15 


ALMA Band 6 


212 


0.14 


75 


-21.1 


-21.1 


946 


1033 


0.037 


0.034 


0.10 


ALMA Band 7 


278 


0.08 


87 


-20.9 


-21.0 


911 


1137 


0.020 


0.015 


0.05 


ALMA Band 8 


406 


0.04 


288 


-20.2 


-21.0 


383 


1216 








ALMA Band 9 


668 


0.02 


721 


-19.6 


-21.0 


265 


1026 








ALMA Band 10 


854 


0.01 


1526 


-19.2 


-20.0 


357 


922 









Super-virial SLED model 



MeerKAT b 


10 


55.44 


38 


-22.7 


-23.3 


1 


91 


0.029 


0.079 


0.75 


SKA K b 


18 


11.16 


1 


-23.8 


-22.7 


3164 


135 


9.811 


70.894 


21.34 


ALMA Band 3 


103 


0.83 


77 


-21.4 


-20.1 


623 


<1 


0.144 


0.040 


0.66 


ALMA Band 4 


147 


0.40 


77 


-21.2 


-20.0 


1341 


34 


0.150 


1.122 


0.46 


ALMA Band 5 


163 


0.24 


86 


-21.2 


-20.0 


1730 


74 


0.115 


0.946 


0.31 


ALMA Band 6 


212 


0.14 


75 


-21.1 


-20.5 


2195 


483 


0.086 


0.336 


0.20 


ALMA Band 7 


278 


0.08 


87 


-20.9 


-20.3 


2176 


576 


0.048 


0.181 


0.11 


ALMA Band 8 


406 


0.04 


288 


-20.2 


-20.3 


615 


703 


0.007 


0.006 


0.02 


ALMA Band 9 


668 


0.02 


721 


-19.6 


-20.2 


283 


566 








ALMA Band 10 


854 


0.01 


1526 


-19.2 


-20.0 


359 


940 









a assuming field-of-view is equivalent to full width at half maximum of primary beam 
b counts per 4 GHz window 
c ltr noise in 300 km s -1 channel 

d time required to detect integrated 300 km s -1 line at 5<r 

e number of sources detected at >5cr in 300 km s" 1 channels in a 10 hour integration, per field-of-view in a 4 or 8 GHz bandwidth 
'. . . ' indicate the counts are effectively zero for practical purposes (<10 — 3 hr — x ) 



submm telecopes (e.g. CCAT potentially has a usable field-of- 
view of 1°). These could also be used for blind tomographic 
surveys of [C II] in the submm atmospheric windows (Figure 
2) in a similar manner to narrowband surveys of emission line 
galaxies in the OIR bands. 

5.1.1. Atacama Large Millimeter Array 

ALMA covers the frequency range 80-900 GHz, which will 
possibly be extended down to 40 GHz with the addition of 
Bands 1 and 2, which would allow coverage of low- J CO 
and HCN transitions beyond z ~ 1 (Figure 2). At comple- 
tion, ALMA will consist of ^50 12 m single feed dishes (on 
baselines of up to 15 km), and we consider this as our model 
of 'full power' ALMA. The field-of-view (as defined by the 
FWHM of the primary beam) across Bands 3-10 is ~0.01- 
0.8 arcmin 2 , scaling as v~ 2 and the ALMA receivers can ob- 
serve up to 8 GHz of instantaneous bandwidth (dual polariza- 
tion), made up of two 4 GHz-wide side-bands, separated by a 
gap of 8 GHz. In this work, for simplicity, we treat the instan- 
taneous bandwidth as a continuous block. 

The typical observing frequency (chosen to be close to the 
peak atmospheric transmission in that band), field-of-view 
and sensitivity (in 300 km s -1 channels) for Bands 3-10 is 
given in Table 2. For the latter, we assume full-power ALMA 
and average weather conditions^. The limiting (5a) line fluxes 

4 for the sensitivity estimates, we have used the latest version 



for both the 1 hour single-shot and optimal integration modes 
are given and for each case we list the total surface den- 
sity of CO J up < 10, HCN J up < 5 and [C II] line emit- 
ters in an 8 GHz band. The detection rates are given by the 
time required to reach the corresponding ler channel sensi- 
tivity, combined with the field-of-view. Two versions of the 
counts are given, corresponding to the 'virial' and 'super- 
virial' models for the dense gas phase (§3.1). Both models 
assume £sf = 0.25, a conservative estimate for the star for- 
mation 'mode' and thus the contribution of the quiescent gas 
phase (§3.2). 

Figure 6 and 7 show the integral counts iV(> -Fh no ), split 
into the individal line species for the two models, clearly 
demonstrating the progression toward high- J/low-z lines with 
increasing frequency, and the dominance of [C II] in the num- 
ber counts in ALMA Bands 4-10 (cut-off at lower frequencies 
as the line moves to very high redshifts). Figures 8 and 9 show 
the relative distribution of line species that would be detected 
when operating at the optimal flux limit, again highlighting 
the dominance of [C II] in Bands 4-10, corresponding to red- 
shifts of 0.5 < z < 10. As expected, HCN forms a negligi- 
ble contribution to the line counts. Due to the dominance of 
[C II] the virial and super-virial models predict similar yields 
of line emitters in the ALMA Bands, but there are subtle dif- 
ferences in the molecular line yields, as shown in Figures 8 

of the ALMA sensitivity calculator: https://almascience.nrao.edu/call-for- 
proposals/sensitivity-calculator 
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Figure 6. Integrated number counts in our virial SLED model of CO J up < 10, HCN J up < 5 and [C II] lines in 8 GHz wide bands at the central frequencies of 
the submm-cm bands (although Bands 1 and 2 are currently hypothetical, pending development). The dashed vertical line indicates where the slope of the counts 
rolls through N oc F~ 2 , thus indicating the flux limit corresponding to an 'optimal' survey at a given frequency (Blain et al. 2000) and the vertical dotted line 
indicates the 5<r depth for a 1 hour integration (this is unknown for ALMA Band 1 and 2). 



& 9, and we again note that in reality the detection rate of 
CO lines will be higher, due to the minimalist approach we 
take. Similarly, other bright far-infrared lines of nitrogen and 
oxygen will contribute (Coppin et al. 2012 in prep). 

We re-iterate that these counts should be considered lower 
limits to the expected yield in a blind survey. Neverthe- 
less - as is not surprising - even when operating at the op- 
timal flux limit which should give the most efficient detec- 
tion rate for a fixed observing time, the small fields-of-view 
at all ALMA frequencies imply significant observational in- 
vestments would be required to perform efficient blind molec- 
ular line surveys using ALMA alone. Clearly the high- 
frequency Bands >7 are impractical for blind surveys, given 



the precipitous decline in field-of-view and sensitivity beyond 
300 GHz. Nevertheless, although challenging, blind (optimal) 
surveys in Band 4 and 5 could be useful for detecting very 
high-z galaxies close to the epoch of re-ionization via their 
[C II] emission, and intermediate redshift CO emitters. 

As an example of the potential reward of a practical observ- 
ing campaign, consider a 100 hour survey at a fixed frequency 
tuning in ALMA Band 4. Operating at the optimal depth limit, 
this survey would yield ^100 [C II] emitters (which would be 
ULIRG/HLIRG-class galaxies at the corresponding line flux 
limit) at z ~ 12. This is probably optimistic; it goes with- 
out saying that, at this high redshift, our model of the in- 
frared luminosity density is highly uncertain - it is almost cer- 
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tainly overestimated because it simply extrapolates the slow 
decline in space density and characteristic luminosity in the 
number counts model. Indeed we currently know of no galax- 
ies beyond z > 10, and so there are literally no constraints, 
other than what can be gleaned from the shape of the far- 
infrared background (which the Bethermin et al. model suc- 
cessfully re-produces). Therefore, the observed abundance of 
[C II] emitters close to the epoch of re-ionisation could be an 
extremely valuable probe of the history of re-ionisation. As an 
example of the impact of the form of the early evolution of the 
infrared luminosity density on the detection rate, if we modify 
the evolution of the Bethermin et al. model such that infrared 
luminosity density at z — 10 is an order of magnitude lower 
(~10 7 Lq Mpc -3 , cf. Figure 4), then the same 'optimal' blind 
survey would expect detections at a rate of one galaxy per ten 
hours of observation in Band 4. 



What are the blind detection prospects for a putative pop- 
ulation of CO-dark, but ircirj/im -boosted metal-poor sys- 
tems we discussed in section §3.3.1? We naively assume that 
the space density evolution of such systems follows that of 
LBGs. For the parameterisation of the evolving co-moving 
number density we use the latest estimates of Bouwens et 
al. (2011), who constrain the rest-frame ultra-violet lumi- 
nosity function at z ~ 7 and z ^ 8 via an application of 
the Lyman Break drop-out technique in very deep Hubble 
Space Telescope infrared and optical imaging. Modelling the 
LBG luminosity function as a Schechter function, Bouwens 
et al. (2011) fit linear evolutions of the characteristic lumi- 
nosity M*, density normalisation 0* and faint end slope a 
that maximize the likelihood of re-producing the formal lu- 
minosity function fits at z ~ 4, 5, 6, 7 and 8. We take 
this model and assume log (Lir/Lq) = log (Lboi/£©) = 
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11.67 - 0.58(Mtj V ,ab + 21) (Bouwens et al. 2009) to esti- 
mate the co-moving space density of [C II] -bright galaxies in 
the ALMA bands. Applying the same optimal survey strategy 
as above, this model suggests that a blind survey in Bands 4-6 
would detect ^2-7 galaxies per hour at redshifts of z ~ 8-12. 
Again, we caution that this estimate relies on extrapolation 
of the evolution of the LBG luminosity function beyond cur- 
rent observational constraints, and the high-z LBG luminosity 
function might not necessarily reflect that of a population of 
metal-poor galaxies in the £[cti]/£ir = 10~ 2 class. How- 
ever, again, this highlights the discovery potential for blind 
surveys at submm-to-mm wavelengths. 
More realistically, ALMA will be routinely used in synergy 



with wide-field sub-millimeter and radio continuum surveys 
(JCMT/SCUBA-2, LMT, CCAT, MeerKAT, ASKAP, etc.), 
which will detect star-forming galaxies out to z ~ 10 (taking 
advantage of the negative ^-correction in the sub-mm bands 
for instance) that can be targeted for redshift identification. 
In this case our model can be used to estimate the minimum 
line flux (and therefore exposure time required) for redshift 
searches as a function of frequency (e.g. Figure 3). In many 
cases the required flux limits can be achieved in a matter of 
minutes with full-power ALMA, although a scan in frequency 
will be required for the detection of several emission lines. 
Wide bandwidth submm direct detection spectrographs (e.g. 
Z-Spectrometer) might be more practical for such a targeted 
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Figure 9. Break-down of line species detected at the optimal flux limit of each band in the super-virial model (Fig. 7). Caption as Figure i 

data. 



search, where several bright far-infrared and CO lines could 
be detected simultaneously (Figure 2). 

Finally, we note that an alternative application of this model 
is to predict the number of serendipitous line detections in 
routine deep (e.g. ^10hr) ALMA observations. Indeed, we 
can ask whether such data-cubes could be 'harvested' for line 
emitters in a semi-blind sense; exploiting the more common- 
place deep, pointed observations of some extragalactic source. 
In Table 2 we list the number of serendipitous detections 
that would be expected per field-of-view in a 10 hour inte- 
gration in each band. In Band 3, one would expect >0.3-0.7 
serendipitously detected sources per 10 hour cube, with detec- 
tion rates naturally falling off with increasing frequency, and 
declining sensitivity and field-of-view. Therefore, long after 
ALMA comes into full operation, one could envision hunting 
for serendipitously detected high-z line emitters in archival 



5.1.2. Square Kilometer Array and MeerKAT 

Phase 3 of the SKA will culminate in an array of 1250 
15 m single-feed dishes; the current design plan is to include 
high-band coverage to ^30 GHz, where the sensitivity will 
be ~14^iJy y/min (in a 30 MHz [300 kms -1 ] bin, dual po- 
larization; S. Rawlings, 2011, private communication). The 
field-of-view at 30 GHz will be ^7 square arcminutes. How- 
ever, SKA's high-frequency capability will only be available 
towards the completion of the telescope, which is likely to be 
towards the end of the next decade (and is therefore subject 
to design change); construction will commence with the low 
frequency receivers. 

One a shorter timescale, one of SKA's main pathfinders, 
the Extended Karoo Array Telescope (MeerKAT) will have 
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a highland receiver covering 8-15 GHz (again, to be con- 
structed in Phase 3 of the project. The other main SKA 
pathfinder located in Western Australia - ASKAP - will not 
cover frequencies beyond 2 GHz and is therefore unsuitable 
for molecular surveys). MeerKAT will consist of 64 13.5 m 
single-feed dishes on baselines up to 20 km. As shown in Fig- 
ure 2, SKA/MeerKAT operating in the radio K/Ka bands will 
be sensitive to CO /(l— >-0) at z > 3, and thus will be vital for 
discovering the molecular gas reservoirs fuelling galaxies in 
the very early Universe, close to the epoch of re-ionization, 
z ~ 6-10 (see Heywood et al. 201 1). 

In Table 2 we present the flux limits and number counts 
for the two observing strategies, assuming 4 GHz windows 
(MeerKAT and SKA are not likely to have receivers capa- 
ble of observing more than 4 GHz of bandwidth). The power 
of SKAs incredible sensitivity is clear here; in the K band 
even our conservative estimates predict that ~30-70 z > 3 
CO/(1^0)-CO/(3^2)line emitters could be detected ev- 
ery hour; at the optimal limit, the galaxies would be in the 
ULIRG luminosity class (again CO emitters dominate the de- 
tections). Figures 6-9 show the integral counts and line dis- 
tribution, which shows how SKA will open-up the z > 3 Uni- 
verse to low-/ CO exploration. 

There is a clear synergy with ALMA here, since SKA will 
not be able to measure the mid-to-high-7 line emission in 
the galaxies it detects. In this case, pointed observations of 
SKA detections with ALMA would be the natural way to ob- 
tain robust redshifts and allow construction of the SLED of 
these high-z galaxies. Armed with at least two lines for each 
galaxy, each tracing different components of the gas reservoir, 
it would be possible to perform another important survey - the 
star formation 'mode' of galaxies. This is discussed in more 
detail in a follow-up work, Papadopoulos & Geach (2012, Pa- 
per II). 

6. SEMI-BLIND REDSHIFT SURVEYS 

Even when ALMA is at full capacity, blind molecular line 
searches will require risky observational investments, albeit 
with the potential for rich reward. Nevertheless, blind red shift 
surveys with ALMA are not out of the realms of possibility, 
and the reward for such an endeavour could be vast. Blind, 
high-redshift low-/ CO surveys with SKA will be highly prac- 
tical, given the sensitivity of the instrument and the reason- 
able field-of-view. Nevertheless, future redshift surveys in the 
submm-to-cm regime will likely target large samples of galax- 
ies pre-selected by their submm or radio continuum emission; 
we call these semi-blind redshift surveys. 

Our emergent model predicts the minimum CO, HCN and 
[C II] line flux for a galaxy with a given Lir (Figure 3), and 
can therefore be used to help design a semi-blind redshift sur- 
vey by providing conservative estimates for the exposure time 
required to detect a galaxy with some estimated Lm , A semi- 
blind survey would have to perform a redshift search at the 
position of each targeted galaxy, scanning in frequency until 
several emission lines are detected (the CO ladder is spaced at 
intervals of Ai/ ~115/ (1 + z) GHz for example). Wide-band 
grating spectrometers (e.g. Z-Spectrometer [Bradford et al. 
2004] and ZEUS [Ferkinhoff et al. 2010]) deployed in a multi- 
object capacity on ground-based telescopes will be ideal for 
this purpose in the shorter wavelength submm windows (Fig- 
ure 2). 

Large-area sub-millimeter and radio continuum surveys 
during the next decade will be able to supply thousands of tar- 
gets for such a semi-blind survey. In the short term, SCUBA- 



2 on the JCMT will be mapping ^lOdeg 2 Ss50 ~1.2mJy 
(la) depths, although this limit still corresponds to rather lu- 
minous (^ULIRG-class) systems at z <~ 2. Similarly Her- 
schel has recently mapped large areas of sky in the 250- 
500^m sub-mm bands (Eales et al. 2010; Oliver et al. 2010), 
but these are relatively shallow surveys suffering from signif- 
icant confusion issues that, at redshifts beyond z ~ 1, are 
generally only tracing the most luminous galaxies, and not 
probing into the L* regime. Finally powerful radio galaxies 
at high redshifts provide excellent beacons for such molec- 
ular line semi-blind redshift surveys as they mark the cen- 
ters of deep potential wells where multiple gas-rich systems 
converge, forming the massive galaxy clusters found in the 
present cosmic epoch (e.g. De Breuck et al. 2004; Miley & 
De Breuck 2008). 

In the near future, much larger single dish sub-millimeter 
telescopes such as the LMT and CCAT will perform more sen- 
sitive, very wide area sub-mm surveys, detecting the majority 
of the star formation rate budget out to z <~ 3. Offering sim- 
ilar promise is the imminent advent of all-sky sensitive radio 
surveys during the next decade. For example, another SKA 
pathfinder, the Australian SKA Pathfinder (ASKAP), will be 
performing a 1.3 GHz radio continuum survey called 'EMU' 
(Evolutionary Map of the Universe). This will be mapping 
the entire sky south of S < +30° to rms^lO/iJy sensitivity 
(Norris et al. 201 1), detecting most of the star-forming galax- 
ies that exist out to z ~ 1. One of the most critical aspects 
of semi-blind surveys will be to properly understand the se- 
lection biases arising from a, say, (sub)mm continuum flux 
limited or stellar mass selected sample, highlighting the need 
for a truly blind survey. 

7. SUMMARY: FORTUNE FAVOURS THE BRAVE 

We have presented a conservative model of the number 
counts of galaxies detected in a blind molecular line survey in 
the sub-mm/mm/cm regime. Our model calculates the 'emer- 
gent' CO, HCN and [Cll] A158^m emission of star-forming 
galaxies, and is rooted in the latest models of star formation 
feedback and empirical data on the HCN SLED (tracing the 
dense gas phase) in local star-forming galaxies. The normal- 
ization of the emergent CO SLED is given by the star forma- 
tion rate, which in this case is taken to be the infrared lumi- 
nosity of a galaxy. Thus, our model describes the minimum 
molecular line emission expected for star-forming galaxies 
based solely on the luminosity of their actively star-forming 
reservoirs. This could be used to design follow-up spectro- 
scopic surveys for an unbiased Lir limited survey. 

Coupled with an up-to-date model for the evolution of the 
infrared luminosity density that successfully re-produces the 
observed number counts of galaxies over a wide range of the 
infrared wavebands (Bethermin et al. 2011), we make pre- 
dictions of the lower limit of integrated number counts of 
line-emitting galaxies across a range of observed frequencies 
and bandpasses pertinent to the main facilities capable of per- 
forming a molecular redshift survey (ALMA, SKA and its 
pathfinders). We consider ambitious blind redshift surveys, 
working at the optimal flux limit set by the predicted knee in 
the galaxy number counts, and discarding information about 
the shape of the spectral line (i.e. binning to a spectral res- 
olution of 1000, i.e. ^300 km s -1 ). Such blind surveys can 
reveal insight into: 

The epoch of re-ionization: The sensitive ALMA bands 
could potentially detect ULIRG-class [C II] emitters close to 
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the epoch of re-ionisation, z > 10, at a rate of up to one 
per hour (although this is highly sensitive to the star for- 
mation history of the Universe at this early time). Nev- 
ertheless, should such extreme systems exist at this epoch, 
a blind ALMA survey would be capable of finding them, 
and their abundance would provide valuable insight into the 
star formation and chemical history of the Universe close to 
the era when the first stars ignited. In our minimal model, 
[C II] emitters dominate blind (optimal) surveys with ALMA, 
however mid-7 CO emitters would also be detected at lower 
rates, but with increasing yields for deeper (but sub-optimal) 
surveys. 

CO-dark galaxies: We also examine the possibility of de- 
tecting [C II] luminous, but CO-dark gas reservoirs in metal- 
poor galaxies at high-z with ALMA. Assuming such a pop- 
ulation exists with a similar space density to Lyman Break 
Galaxies, blind surveys with ALMA could detect systems at 
z <~ 8-12 with optimal rates of ^2-7 per hour. 

Efficient blind surveys of low-J CO emitters at z > 3: 
The SKA will represent a sea-change in the sensitivity of 
radio/cm-wave surveys, with SKA Phase 3 (offering access 
to the radio K band) providing access to low-/ CO emission 
at z > 3. We predict that an optimal redshift survey could de- 
tect ^30-70 ULIRG-class CO emitters per hour. While our 
model is based on the abundance of star-forming galaxies, 
blind SKA surveys could also detect outliers from the stan- 
dard Schmidt-Kennicutt relation. In a follow-up work, Paper 
II (Papadopoulos & Geach 2012), we consider the detectabil- 
ity of 'pre-starburst' galaxies, representing a brief gas-rich 
phase preceding the onset of an episode of intense star for- 
mation where the host galaxy is extremely difficult to detect 
in any other waveband. 

The coming decade and the years beyond will be an excit- 
ing time for extragalactic astronomy: we will routinely de- 
tect molecular emission from high-redshift galaxies, break- 
ing through the sensitivity floor that has limited the major- 
ity of current studies to the most luminous or fortuitously 
gravitationally lensed galaxies. This work presents a simple, 
empirically-based model to aid in the design of redshift sur- 
veys (both blind and semi-blind). Although we promote am- 
bitious observations, with - arguably - speculative results, we 
are motivated by the rich spoils: totally new and, in some 
cases, unique insights into the physics of galaxy formation 
that could be the reward for such efforts. 
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